Improving NLP through Marginalization of Hidden Syntactic Structure

نویسندگان

  • Jason Naradowsky
  • Sebastian Riedel
  • David A. Smith
چکیده

Many NLP tasks make predictions that are inherently coupled to syntactic relations, but for many languages the resources required to provide such syntactic annotations are unavailable. For others it is unclear exactly how much of the syntactic annotations can be effectively leveraged with current models, and what structures in the syntactic trees are most relevant to the current task. We propose a novel method which avoids the need for any syntactically annotated data when predicting a related NLP task. Our method couples latent syntactic representations, constrained to form valid dependency graphs or constituency parses, with the prediction task via specialized factors in a Markov random field. At both training and test time we marginalize over this hidden structure, learning the optimal latent representations for the problem. Results show that this approach provides significant gains over a syntactically uninformed baseline, outperforming models that observe syntax on an English relation extraction task, and performing comparably to them in semantic role labeling.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Factors Affecting the Development of Marginalization and its Social Consequences in Birjand

Outbreak of security, hygienic, and … problems has caused civil managers of Birjand to realize the existence of marginalization phenomenon within the city; and seek to detect the procedure of development and organization, specially descending social consequences of this issue. The research is descriptive and applicable and is implemented to target the detection of the effective factors on formi...

متن کامل

ACL - COLING 1998 , Montreal , Canada , 491 - 497 , 1998 Improving Data Driven

In this paper we examine how the diierences in modelling between diierent data driven systems performing the same NLP task can be exploited to yield a higher accuracy than the best individual system. We do this by means of an experiment involving the task of morpho-syntactic wordclass tagging. Four well-known tagger generators (Hidden Markov Model, Memory-Based, Transformation Rules and Maximum...

متن کامل

ACL - COLING 1998 , Montreal , Canada , 491 - 497 , 1998 Improving Data

In this paper we examine how the di erences in modelling between di erent data driven systems performing the same NLP task can be exploited to yield a higher accuracy than the best indi vidual system We do this by means of an ex periment involving the task of morpho syntactic wordclass tagging Four well known tagger gen erators Hidden Markov Model Memory Based Transformation Rules and Maximum E...

متن کامل

Word Representations, Tree Models and Syntactic Functions

Word representations induced from models with discrete latent variables (e.g. HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem as unsupervised learning of tree-structured hidden Markov models. Syntactic functions are used as additional observed variables in the model, influencing both...

متن کامل

Improving Accuracy in Wordclass Tagging through Combination of Machine Learning Systems

We examine how differences in language models, learned by different data driven systems performing the same NLP task, can be exploited to yield a higher accuracy than the best individual system. We do this by means of experiments involving the task of morpho-syntactic wordclass tagging, on the basis of three different tagged corpora. Four well-known tagger generators (Hidden Markov Model, Memor...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012